Overview
Brought to you by YData
Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 4277 |
| Missing cells | 1117 |
| Missing cells (%) | 2.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.7 MiB |
| Average record size in memory | 406.9 B |
Variable types
| Text | 3 |
|---|---|
| Categorical | 2 |
| Boolean | 2 |
| Numeric | 6 |
FoodCourt is highly overall correlated with VRDeck | High correlation |
VRDeck is highly overall correlated with FoodCourt | High correlation |
VIP is highly imbalanced (87.2%) | Imbalance |
HomePlanet has 87 (2.0%) missing values | Missing |
CryoSleep has 93 (2.2%) missing values | Missing |
Cabin has 100 (2.3%) missing values | Missing |
Destination has 92 (2.2%) missing values | Missing |
Age has 91 (2.1%) missing values | Missing |
VIP has 93 (2.2%) missing values | Missing |
RoomService has 82 (1.9%) missing values | Missing |
FoodCourt has 106 (2.5%) missing values | Missing |
ShoppingMall has 98 (2.3%) missing values | Missing |
Spa has 101 (2.4%) missing values | Missing |
VRDeck has 80 (1.9%) missing values | Missing |
Name has 94 (2.2%) missing values | Missing |
PassengerId has unique values | Unique |
Age has 82 (1.9%) zeros | Zeros |
RoomService has 2726 (63.7%) zeros | Zeros |
FoodCourt has 2690 (62.9%) zeros | Zeros |
ShoppingMall has 2744 (64.2%) zeros | Zeros |
Spa has 2611 (61.0%) zeros | Zeros |
VRDeck has 2757 (64.5%) zeros | Zeros |
Reproduction
| Analysis started | 2024-12-04 18:51:02.966191 |
|---|---|
| Analysis finished | 2024-12-04 18:51:05.780867 |
| Duration | 2.81 seconds |
| Software version | ydata-profiling vv4.11.0 |
| Download configuration | config.json |
Variables
PassengerId
Text
Unique 
| Distinct | 4277 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 234.0 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 4277 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0013_01 |
|---|---|
| 2nd row | 0018_01 |
| 3rd row | 0019_01 |
| 4th row | 0021_01 |
| 5th row | 0023_01 |
| Value | Count | Frequency (%) |
| 0027_01 | 1 | < 0.1% |
| 9277_01 | 1 | < 0.1% |
| 0013_01 | 1 | < 0.1% |
| 0018_01 | 1 | < 0.1% |
| 0019_01 | 1 | < 0.1% |
| 9240_01 | 1 | < 0.1% |
| 9243_01 | 1 | < 0.1% |
| 9245_01 | 1 | < 0.1% |
| 9249_01 | 1 | < 0.1% |
| 9255_01 | 1 | < 0.1% |
| Other values (4267) | 4267 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5982 | |
| 1 | 4806 | |
| _ | 4277 | |
| 2 | 2434 | |
| 3 | 2072 | 6.9% |
| 5 | 1829 | 6.1% |
| 4 | 1808 | 6.0% |
| 7 | 1786 | 6.0% |
| 6 | 1777 | 5.9% |
| 8 | 1755 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29939 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5982 | |
| 1 | 4806 | |
| _ | 4277 | |
| 2 | 2434 | |
| 3 | 2072 | 6.9% |
| 5 | 1829 | 6.1% |
| 4 | 1808 | 6.0% |
| 7 | 1786 | 6.0% |
| 6 | 1777 | 5.9% |
| 8 | 1755 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29939 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5982 | |
| 1 | 4806 | |
| _ | 4277 | |
| 2 | 2434 | |
| 3 | 2072 | 6.9% |
| 5 | 1829 | 6.1% |
| 4 | 1808 | 6.0% |
| 7 | 1786 | 6.0% |
| 6 | 1777 | 5.9% |
| 8 | 1755 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29939 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 5982 | |
| 1 | 4806 | |
| _ | 4277 | |
| 2 | 2434 | |
| 3 | 2072 | 6.9% |
| 5 | 1829 | 6.1% |
| 4 | 1808 | 6.0% |
| 7 | 1786 | 6.0% |
| 6 | 1777 | 5.9% |
| 8 | 1755 | 5.9% |
HomePlanet
Categorical
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 87 |
| Missing (%) | 2.0% |
| Memory size | 225.9 KiB |
| Earth | |
|---|---|
| Europa | |
| Mars |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.0183771 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Earth |
|---|---|
| 2nd row | Earth |
| 3rd row | Europa |
| 4th row | Europa |
| 5th row | Earth |
Common Values
| Value | Count | Frequency (%) |
| Earth | 2263 | |
| Europa | 1002 | |
| Mars | 925 | |
| (Missing) | 87 | 2.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| earth | 2263 | |
| europa | 1002 | |
| mars | 925 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4190 | |
| r | 4190 | |
| E | 3265 | |
| t | 2263 | |
| h | 2263 | |
| u | 1002 | 4.8% |
| o | 1002 | 4.8% |
| p | 1002 | 4.8% |
| M | 925 | 4.4% |
| s | 925 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 21027 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 4190 | |
| r | 4190 | |
| E | 3265 | |
| t | 2263 | |
| h | 2263 | |
| u | 1002 | 4.8% |
| o | 1002 | 4.8% |
| p | 1002 | 4.8% |
| M | 925 | 4.4% |
| s | 925 | 4.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 21027 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 4190 | |
| r | 4190 | |
| E | 3265 | |
| t | 2263 | |
| h | 2263 | |
| u | 1002 | 4.8% |
| o | 1002 | 4.8% |
| p | 1002 | 4.8% |
| M | 925 | 4.4% |
| s | 925 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 21027 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 4190 | |
| r | 4190 | |
| E | 3265 | |
| t | 2263 | |
| h | 2263 | |
| u | 1002 | 4.8% |
| o | 1002 | 4.8% |
| p | 1002 | 4.8% |
| M | 925 | 4.4% |
| s | 925 | 4.4% |
CryoSleep
Boolean
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 93 |
| Missing (%) | 2.2% |
| Memory size | 150.1 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 93 |
| Value | Count | Frequency (%) |
| False | 2640 | |
| True | 1544 | |
| (Missing) | 93 | 2.2% |
Cabin
Text
Missing 
| Distinct | 3265 |
|---|---|
| Distinct (%) | 78.2% |
| Missing | 100 |
| Missing (%) | 2.3% |
| Memory size | 232.0 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.0813981 |
| Min length | 5 |
Unique
| Unique | 2714 ? |
|---|---|
| Unique (%) | 65.0% |
Sample
| 1st row | G/3/S |
|---|---|
| 2nd row | F/4/S |
| 3rd row | C/0/S |
| 4th row | C/1/S |
| 5th row | F/5/S |
| Value | Count | Frequency (%) |
| g/160/p | 8 | 0.2% |
| d/273/s | 7 | 0.2% |
| g/748/s | 7 | 0.2% |
| e/228/s | 7 | 0.2% |
| b/31/p | 7 | 0.2% |
| f/579/p | 6 | 0.1% |
| c/177/s | 6 | 0.1% |
| c/295/p | 6 | 0.1% |
| b/214/p | 6 | 0.1% |
| g/1052/p | 6 | 0.1% |
| Other values (3255) | 4111 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 8354 | |
| 1 | 2598 | 8.8% |
| S | 2093 | 7.1% |
| P | 2084 | 7.0% |
| 2 | 1549 | 5.2% |
| F | 1445 | 4.9% |
| 4 | 1279 | 4.3% |
| 3 | 1264 | 4.3% |
| G | 1222 | 4.1% |
| 5 | 1110 | 3.8% |
| Other values (11) | 6581 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29579 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| / | 8354 | |
| 1 | 2598 | 8.8% |
| S | 2093 | 7.1% |
| P | 2084 | 7.0% |
| 2 | 1549 | 5.2% |
| F | 1445 | 4.9% |
| 4 | 1279 | 4.3% |
| 3 | 1264 | 4.3% |
| G | 1222 | 4.1% |
| 5 | 1110 | 3.8% |
| Other values (11) | 6581 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29579 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| / | 8354 | |
| 1 | 2598 | 8.8% |
| S | 2093 | 7.1% |
| P | 2084 | 7.0% |
| 2 | 1549 | 5.2% |
| F | 1445 | 4.9% |
| 4 | 1279 | 4.3% |
| 3 | 1264 | 4.3% |
| G | 1222 | 4.1% |
| 5 | 1110 | 3.8% |
| Other values (11) | 6581 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29579 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| / | 8354 | |
| 1 | 2598 | 8.8% |
| S | 2093 | 7.1% |
| P | 2084 | 7.0% |
| 2 | 1549 | 5.2% |
| F | 1445 | 4.9% |
| 4 | 1279 | 4.3% |
| 3 | 1264 | 4.3% |
| G | 1222 | 4.1% |
| 5 | 1110 | 3.8% |
| Other values (11) | 6581 |
Destination
Categorical
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 92 |
| Missing (%) | 2.2% |
| Memory size | 251.1 KiB |
| TRAPPIST-1e | |
|---|---|
| 55 Cancri e | |
| PSO J318.5-22 |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 11.185424 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TRAPPIST-1e |
|---|---|
| 2nd row | TRAPPIST-1e |
| 3rd row | 55 Cancri e |
| 4th row | TRAPPIST-1e |
| 5th row | TRAPPIST-1e |
Common Values
| Value | Count | Frequency (%) |
| TRAPPIST-1e | 2956 | |
| 55 Cancri e | 841 | 19.7% |
| PSO J318.5-22 | 388 | 9.1% |
| (Missing) | 92 | 2.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| trappist-1e | 2956 | |
| 55 | 841 | 13.4% |
| cancri | 841 | 13.4% |
| e | 841 | 13.4% |
| pso | 388 | 6.2% |
| j318.5-22 | 388 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 6300 | |
| T | 5912 | |
| e | 3797 | 8.1% |
| - | 3344 | 7.1% |
| S | 3344 | 7.1% |
| 1 | 3344 | 7.1% |
| R | 2956 | 6.3% |
| I | 2956 | 6.3% |
| A | 2956 | 6.3% |
| 5 | 2070 | 4.4% |
| Other values (13) | 9832 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 46811 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| P | 6300 | |
| T | 5912 | |
| e | 3797 | 8.1% |
| - | 3344 | 7.1% |
| S | 3344 | 7.1% |
| 1 | 3344 | 7.1% |
| R | 2956 | 6.3% |
| I | 2956 | 6.3% |
| A | 2956 | 6.3% |
| 5 | 2070 | 4.4% |
| Other values (13) | 9832 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 46811 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| P | 6300 | |
| T | 5912 | |
| e | 3797 | 8.1% |
| - | 3344 | 7.1% |
| S | 3344 | 7.1% |
| 1 | 3344 | 7.1% |
| R | 2956 | 6.3% |
| I | 2956 | 6.3% |
| A | 2956 | 6.3% |
| 5 | 2070 | 4.4% |
| Other values (13) | 9832 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 46811 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| P | 6300 | |
| T | 5912 | |
| e | 3797 | 8.1% |
| - | 3344 | 7.1% |
| S | 3344 | 7.1% |
| 1 | 3344 | 7.1% |
| R | 2956 | 6.3% |
| I | 2956 | 6.3% |
| A | 2956 | 6.3% |
| 5 | 2070 | 4.4% |
| Other values (13) | 9832 |
Age
Real number (ℝ)
Missing  Zeros 
| Distinct | 79 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 91 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.658146 |
| Minimum | 0 |
|---|---|
| Maximum | 79 |
| Zeros | 82 |
| Zeros (%) | 1.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 19 |
| median | 26 |
| Q3 | 37 |
| 95-th percentile | 55 |
| Maximum | 79 |
| Range | 79 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 14.179072 |
|---|---|
| Coefficient of variation (CV) | 0.49476583 |
| Kurtosis | 0.21852293 |
| Mean | 28.658146 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.48480029 |
| Sum | 119963 |
| Variance | 201.04607 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18 | 176 | 4.1% |
| 22 | 163 | 3.8% |
| 19 | 162 | 3.8% |
| 20 | 160 | 3.7% |
| 24 | 158 | 3.7% |
| 21 | 157 | 3.7% |
| 25 | 156 | 3.6% |
| 23 | 144 | 3.4% |
| 26 | 132 | 3.1% |
| 27 | 127 | 3.0% |
| Other values (69) | 2651 |
| Value | Count | Frequency (%) |
| 0 | 82 | |
| 1 | 27 | 0.6% |
| 2 | 35 | |
| 3 | 34 | |
| 4 | 20 | 0.5% |
| 5 | 20 | 0.5% |
| 6 | 25 | 0.6% |
| 7 | 13 | 0.3% |
| 8 | 24 | 0.6% |
| 9 | 21 | 0.5% |
| Value | Count | Frequency (%) |
| 79 | 2 | < 0.1% |
| 78 | 1 | < 0.1% |
| 77 | 1 | < 0.1% |
| 75 | 2 | < 0.1% |
| 74 | 2 | < 0.1% |
| 73 | 5 | |
| 72 | 3 | |
| 71 | 2 | < 0.1% |
| 70 | 2 | < 0.1% |
| 69 | 6 |
VIP
Boolean
Imbalance  Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 93 |
| Missing (%) | 2.2% |
| Memory size | 150.1 KiB |
| False | |
|---|---|
| True | 74 |
| (Missing) | 93 |
| Value | Count | Frequency (%) |
| False | 4110 | |
| True | 74 | 1.7% |
| (Missing) | 93 | 2.2% |
RoomService
Real number (ℝ)
Missing  Zeros 
| Distinct | 842 |
|---|---|
| Distinct (%) | 20.1% |
| Missing | 82 |
| Missing (%) | 1.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 219.26627 |
| Minimum | 0 |
|---|---|
| Maximum | 11567 |
| Zeros | 2726 |
| Zeros (%) | 63.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 53 |
| 95-th percentile | 1274.5 |
| Maximum | 11567 |
| Range | 11567 |
| Interquartile range (IQR) | 53 |
Descriptive statistics
| Standard deviation | 607.01129 |
|---|---|
| Coefficient of variation (CV) | 2.7683751 |
| Kurtosis | 53.216268 |
| Mean | 219.26627 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.5583897 |
| Sum | 919822 |
| Variance | 368462.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2726 | |
| 1 | 68 | 1.6% |
| 2 | 34 | 0.8% |
| 3 | 28 | 0.7% |
| 4 | 24 | 0.6% |
| 6 | 16 | 0.4% |
| 5 | 15 | 0.4% |
| 9 | 13 | 0.3% |
| 8 | 12 | 0.3% |
| 13 | 11 | 0.3% |
| Other values (832) | 1248 | |
| (Missing) | 82 | 1.9% |
| Value | Count | Frequency (%) |
| 0 | 2726 | |
| 1 | 68 | 1.6% |
| 2 | 34 | 0.8% |
| 3 | 28 | 0.7% |
| 4 | 24 | 0.6% |
| 5 | 15 | 0.4% |
| 6 | 16 | 0.4% |
| 7 | 8 | 0.2% |
| 8 | 12 | 0.3% |
| 9 | 13 | 0.3% |
| Value | Count | Frequency (%) |
| 11567 | 1 | |
| 7407 | 1 | |
| 6438 | 1 | |
| 5900 | 1 | |
| 5862 | 1 | |
| 5454 | 1 | |
| 5333 | 1 | |
| 5100 | 1 | |
| 4922 | 1 | |
| 4908 | 1 |
FoodCourt
Real number (ℝ)
High correlation  Missing  Zeros 
| Distinct | 902 |
|---|---|
| Distinct (%) | 21.6% |
| Missing | 106 |
| Missing (%) | 2.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 439.4843 |
| Minimum | 0 |
|---|---|
| Maximum | 25273 |
| Zeros | 2690 |
| Zeros (%) | 62.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 78 |
| 95-th percentile | 2518.5 |
| Maximum | 25273 |
| Range | 25273 |
| Interquartile range (IQR) | 78 |
Descriptive statistics
| Standard deviation | 1527.663 |
|---|---|
| Coefficient of variation (CV) | 3.4760356 |
| Kurtosis | 67.764434 |
| Mean | 439.4843 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.9106254 |
| Sum | 1833089 |
| Variance | 2333754.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2690 | |
| 1 | 59 | 1.4% |
| 2 | 30 | 0.7% |
| 4 | 22 | 0.5% |
| 3 | 21 | 0.5% |
| 6 | 20 | 0.5% |
| 5 | 19 | 0.4% |
| 7 | 13 | 0.3% |
| 11 | 12 | 0.3% |
| 10 | 12 | 0.3% |
| Other values (892) | 1273 | |
| (Missing) | 106 | 2.5% |
| Value | Count | Frequency (%) |
| 0 | 2690 | |
| 1 | 59 | 1.4% |
| 2 | 30 | 0.7% |
| 3 | 21 | 0.5% |
| 4 | 22 | 0.5% |
| 5 | 19 | 0.4% |
| 6 | 20 | 0.5% |
| 7 | 13 | 0.3% |
| 8 | 11 | 0.3% |
| 9 | 8 | 0.2% |
| Value | Count | Frequency (%) |
| 25273 | 1 | |
| 23397 | 1 | |
| 20809 | 1 | |
| 20229 | 1 | |
| 16963 | 1 | |
| 16954 | 1 | |
| 16250 | 1 | |
| 16071 | 1 | |
| 12350 | 1 | |
| 11984 | 1 |
ShoppingMall
Real number (ℝ)
Missing  Zeros 
| Distinct | 715 |
|---|---|
| Distinct (%) | 17.1% |
| Missing | 98 |
| Missing (%) | 2.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 177.29553 |
| Minimum | 0 |
|---|---|
| Maximum | 8292 |
| Zeros | 2744 |
| Zeros (%) | 64.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 33 |
| 95-th percentile | 994.1 |
| Maximum | 8292 |
| Range | 8292 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 560.82112 |
|---|---|
| Coefficient of variation (CV) | 3.1631995 |
| Kurtosis | 68.221142 |
| Mean | 177.29553 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.8249391 |
| Sum | 740918 |
| Variance | 314520.33 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2744 | |
| 1 | 72 | 1.7% |
| 3 | 35 | 0.8% |
| 2 | 32 | 0.7% |
| 4 | 24 | 0.6% |
| 7 | 19 | 0.4% |
| 9 | 17 | 0.4% |
| 8 | 16 | 0.4% |
| 12 | 13 | 0.3% |
| 10 | 12 | 0.3% |
| Other values (705) | 1195 | |
| (Missing) | 98 | 2.3% |
| Value | Count | Frequency (%) |
| 0 | 2744 | |
| 1 | 72 | 1.7% |
| 2 | 32 | 0.7% |
| 3 | 35 | 0.8% |
| 4 | 24 | 0.6% |
| 5 | 11 | 0.3% |
| 6 | 12 | 0.3% |
| 7 | 19 | 0.4% |
| 8 | 16 | 0.4% |
| 9 | 17 | 0.4% |
| Value | Count | Frequency (%) |
| 8292 | 1 | |
| 8251 | 1 | |
| 8098 | 1 | |
| 8017 | 1 | |
| 7022 | 1 | |
| 6252 | 1 | |
| 6108 | 1 | |
| 6061 | 1 | |
| 6023 | 1 | |
| 5649 | 1 |
Spa
Real number (ℝ)
Missing  Zeros 
| Distinct | 833 |
|---|---|
| Distinct (%) | 19.9% |
| Missing | 101 |
| Missing (%) | 2.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 303.05244 |
| Minimum | 0 |
|---|---|
| Maximum | 19844 |
| Zeros | 2611 |
| Zeros (%) | 61.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 50 |
| 95-th percentile | 1525 |
| Maximum | 19844 |
| Range | 19844 |
| Interquartile range (IQR) | 50 |
Descriptive statistics
| Standard deviation | 1117.186 |
|---|---|
| Coefficient of variation (CV) | 3.6864445 |
| Kurtosis | 80.460402 |
| Mean | 303.05244 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.6902979 |
| Sum | 1265547 |
| Variance | 1248104.6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2611 | |
| 1 | 72 | 1.7% |
| 2 | 43 | 1.0% |
| 3 | 29 | 0.7% |
| 4 | 27 | 0.6% |
| 6 | 23 | 0.5% |
| 8 | 22 | 0.5% |
| 7 | 19 | 0.4% |
| 5 | 16 | 0.4% |
| 9 | 16 | 0.4% |
| Other values (823) | 1298 | |
| (Missing) | 101 | 2.4% |
| Value | Count | Frequency (%) |
| 0 | 2611 | |
| 1 | 72 | 1.7% |
| 2 | 43 | 1.0% |
| 3 | 29 | 0.7% |
| 4 | 27 | 0.6% |
| 5 | 16 | 0.4% |
| 6 | 23 | 0.5% |
| 7 | 19 | 0.4% |
| 8 | 22 | 0.5% |
| 9 | 16 | 0.4% |
| Value | Count | Frequency (%) |
| 19844 | 1 | |
| 15733 | 1 | |
| 15255 | 1 | |
| 14252 | 1 | |
| 13983 | 1 | |
| 12842 | 1 | |
| 12767 | 1 | |
| 12690 | 1 | |
| 12437 | 1 | |
| 11483 | 1 |
VRDeck
Real number (ℝ)
High correlation  Missing  Zeros 
| Distinct | 796 |
|---|---|
| Distinct (%) | 19.0% |
| Missing | 80 |
| Missing (%) | 1.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 310.71003 |
| Minimum | 0 |
|---|---|
| Maximum | 22272 |
| Zeros | 2757 |
| Zeros (%) | 64.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 36 |
| 95-th percentile | 1536.8 |
| Maximum | 22272 |
| Range | 22272 |
| Interquartile range (IQR) | 36 |
Descriptive statistics
| Standard deviation | 1246.9947 |
|---|---|
| Coefficient of variation (CV) | 4.0133714 |
| Kurtosis | 93.842398 |
| Mean | 310.71003 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.38721 |
| Sum | 1304050 |
| Variance | 1554995.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2757 | |
| 1 | 72 | 1.7% |
| 2 | 38 | 0.9% |
| 3 | 33 | 0.8% |
| 7 | 23 | 0.5% |
| 6 | 21 | 0.5% |
| 4 | 20 | 0.5% |
| 5 | 17 | 0.4% |
| 32 | 10 | 0.2% |
| 8 | 10 | 0.2% |
| Other values (786) | 1196 | |
| (Missing) | 80 | 1.9% |
| Value | Count | Frequency (%) |
| 0 | 2757 | |
| 1 | 72 | 1.7% |
| 2 | 38 | 0.9% |
| 3 | 33 | 0.8% |
| 4 | 20 | 0.5% |
| 5 | 17 | 0.4% |
| 6 | 21 | 0.5% |
| 7 | 23 | 0.5% |
| 8 | 10 | 0.2% |
| 9 | 9 | 0.2% |
| Value | Count | Frequency (%) |
| 22272 | 1 | |
| 19086 | 1 | |
| 18670 | 1 | |
| 16514 | 1 | |
| 15940 | 1 | |
| 15125 | 1 | |
| 14834 | 1 | |
| 14587 | 1 | |
| 14268 | 1 | |
| 12863 | 1 |
Name
Text
Missing 
| Distinct | 4176 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 94 |
| Missing (%) | 2.2% |
| Memory size | 260.6 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 15 |
| Mean length | 13.756634 |
| Min length | 7 |
Unique
| Unique | 4169 ? |
|---|---|
| Unique (%) | 99.7% |
Sample
| 1st row | Nelly Carsoning |
|---|---|
| 2nd row | Lerome Peckers |
| 3rd row | Sabih Unhearfus |
| 4th row | Meratz Caltilter |
| 5th row | Brence Harperez |
| Value | Count | Frequency (%) |
| extraly | 14 | 0.2% |
| hopperett | 13 | 0.2% |
| tranklinay | 11 | 0.1% |
| apple | 10 | 0.1% |
| dickley | 10 | 0.1% |
| garrez | 10 | 0.1% |
| pughlinsons | 9 | 0.1% |
| logannon | 9 | 0.1% |
| petton | 9 | 0.1% |
| emenez | 9 | 0.1% |
| Other values (3821) | 8262 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6188 | 10.8% |
| a | 5010 | 8.7% |
| n | 4535 | 7.9% |
| 4183 | 7.3% | |
| r | 3692 | 6.4% |
| o | 3225 | 5.6% |
| l | 3097 | 5.4% |
| i | 3011 | 5.2% |
| s | 2657 | 4.6% |
| t | 2246 | 3.9% |
| Other values (43) | 19700 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 57544 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 6188 | 10.8% |
| a | 5010 | 8.7% |
| n | 4535 | 7.9% |
| 4183 | 7.3% | |
| r | 3692 | 6.4% |
| o | 3225 | 5.6% |
| l | 3097 | 5.4% |
| i | 3011 | 5.2% |
| s | 2657 | 4.6% |
| t | 2246 | 3.9% |
| Other values (43) | 19700 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 57544 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 6188 | 10.8% |
| a | 5010 | 8.7% |
| n | 4535 | 7.9% |
| 4183 | 7.3% | |
| r | 3692 | 6.4% |
| o | 3225 | 5.6% |
| l | 3097 | 5.4% |
| i | 3011 | 5.2% |
| s | 2657 | 4.6% |
| t | 2246 | 3.9% |
| Other values (43) | 19700 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 57544 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 6188 | 10.8% |
| a | 5010 | 8.7% |
| n | 4535 | 7.9% |
| 4183 | 7.3% | |
| r | 3692 | 6.4% |
| o | 3225 | 5.6% |
| l | 3097 | 5.4% |
| i | 3011 | 5.2% |
| s | 2657 | 4.6% |
| t | 2246 | 3.9% |
| Other values (43) | 19700 |
Interactions
Correlations
| Age | CryoSleep | Destination | FoodCourt | HomePlanet | RoomService | ShoppingMall | Spa | VIP | VRDeck | |
|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | 0.093 | 0.039 | 0.172 | 0.207 | 0.077 | 0.067 | 0.182 | 0.096 | 0.155 |
| CryoSleep | 0.093 | 1.000 | 0.126 | 0.171 | 0.140 | 0.190 | 0.205 | 0.142 | 0.071 | 0.138 |
| Destination | 0.039 | 0.126 | 1.000 | 0.093 | 0.279 | 0.057 | 0.050 | 0.085 | 0.018 | 0.070 |
| FoodCourt | 0.172 | 0.171 | 0.093 | 1.000 | 0.262 | 0.185 | 0.190 | 0.457 | 0.114 | 0.506 |
| HomePlanet | 0.207 | 0.140 | 0.279 | 0.262 | 1.000 | 0.195 | 0.132 | 0.204 | 0.152 | 0.204 |
| RoomService | 0.077 | 0.190 | 0.057 | 0.185 | 0.195 | 1.000 | 0.446 | 0.249 | 0.059 | 0.198 |
| ShoppingMall | 0.067 | 0.205 | 0.050 | 0.190 | 0.132 | 0.446 | 1.000 | 0.272 | 0.036 | 0.192 |
| Spa | 0.182 | 0.142 | 0.085 | 0.457 | 0.204 | 0.249 | 0.272 | 1.000 | 0.166 | 0.451 |
| VIP | 0.096 | 0.071 | 0.018 | 0.114 | 0.152 | 0.059 | 0.036 | 0.166 | 1.000 | 0.114 |
| VRDeck | 0.155 | 0.138 | 0.070 | 0.506 | 0.204 | 0.198 | 0.192 | 0.451 | 0.114 | 1.000 |
Missing values
Sample
| PassengerId | HomePlanet | CryoSleep | Cabin | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Name | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0013_01 | Earth | True | G/3/S | TRAPPIST-1e | 27.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Nelly Carsoning |
| 1 | 0018_01 | Earth | False | F/4/S | TRAPPIST-1e | 19.0 | False | 0.0 | 9.0 | 0.0 | 2823.0 | 0.0 | Lerome Peckers |
| 2 | 0019_01 | Europa | True | C/0/S | 55 Cancri e | 31.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Sabih Unhearfus |
| 3 | 0021_01 | Europa | False | C/1/S | TRAPPIST-1e | 38.0 | False | 0.0 | 6652.0 | 0.0 | 181.0 | 585.0 | Meratz Caltilter |
| 4 | 0023_01 | Earth | False | F/5/S | TRAPPIST-1e | 20.0 | False | 10.0 | 0.0 | 635.0 | 0.0 | 0.0 | Brence Harperez |
| 5 | 0027_01 | Earth | False | F/7/P | TRAPPIST-1e | 31.0 | False | 0.0 | 1615.0 | 263.0 | 113.0 | 60.0 | Karlen Ricks |
| 6 | 0029_01 | Europa | True | B/2/P | 55 Cancri e | 21.0 | False | 0.0 | NaN | 0.0 | 0.0 | 0.0 | Aldah Ainserfle |
| 7 | 0032_01 | Europa | True | D/0/S | TRAPPIST-1e | 20.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Acrabi Pringry |
| 8 | 0032_02 | Europa | True | D/0/S | 55 Cancri e | 23.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Dhena Pringry |
| 9 | 0033_01 | Earth | False | F/7/S | 55 Cancri e | 24.0 | False | 0.0 | 639.0 | 0.0 | 0.0 | 0.0 | Eliana Delazarson |
| PassengerId | HomePlanet | CryoSleep | Cabin | Destination | Age | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Name | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4267 | 9260_01 | Earth | True | G/1503/P | 55 Cancri e | 3.0 | NaN | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Luisy Portananney |
| 4268 | 9262_01 | Earth | False | F/1795/S | 55 Cancri e | 20.0 | False | 0.0 | 601.0 | 103.0 | 35.0 | 0.0 | Sonald Hurchrisong |
| 4269 | 9263_01 | Earth | True | G/1495/S | TRAPPIST-1e | 43.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Loisey Heney |
| 4270 | 9265_01 | Mars | False | D/278/S | TRAPPIST-1e | 43.0 | False | 47.0 | 0.0 | 3851.0 | 0.0 | 0.0 | Toate Cure |
| 4271 | 9266_01 | Earth | False | F/1796/S | TRAPPIST-1e | 40.0 | False | 0.0 | 865.0 | 0.0 | 3.0 | 0.0 | Danna Peter |
| 4272 | 9266_02 | Earth | True | G/1496/S | TRAPPIST-1e | 34.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Jeron Peter |
| 4273 | 9269_01 | Earth | False | NaN | TRAPPIST-1e | 42.0 | False | 0.0 | 847.0 | 17.0 | 10.0 | 144.0 | Matty Scheron |
| 4274 | 9271_01 | Mars | True | D/296/P | 55 Cancri e | NaN | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Jayrin Pore |
| 4275 | 9273_01 | Europa | False | D/297/P | NaN | NaN | False | 0.0 | 2680.0 | 0.0 | 0.0 | 523.0 | Kitakan Conale |
| 4276 | 9277_01 | Earth | True | G/1498/S | PSO J318.5-22 | 43.0 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Lilace Leonzaley |